Combining independent modules in lexical multiple-choice problems
نویسندگان
چکیده
Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of multiple modules to create more accurate solutions. This paper examines three merging rules for combining probability distributions: the familiar mixture rule, the logarithmic rule, and a novel product rule. These rules were applied with state-of-the-art results to two problems used to assess human mastery of lexical semantics — synonym questions and analogy questions. All three merging rules result in ensembles that are more accurate than any of their component modules. The differences among the three rules are not statistically significant, but it is suggestive that the popular mixture rule is not the best rule for either of the two problems.
منابع مشابه
Combining Independent Modules to Solve Multiple-choice Synonym and Analogy Problems
Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of successful, separately developed modules to create more accurate solutions. This paper examines three merging rule...
متن کاملThe Effects of Task Orientation and Involvement Load on Learning Collocations
This study examined the effects of input-oriented and output-oriented tasks with different involvement load indices on Iranian EFL learners' comprehension and production of lexical collocations. To achieve this purpose, a sample of 180 intermediate-level EFL learners (both male and female) participated in the study. The participants were in six experimental groups. Each of the groups was random...
متن کاملA Hybrid Machine Translation System Based on a Monotone Decoder
In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...
متن کاملAddressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice
Morphological complexity • Data sparsity due to uncovered inflected forms • Difficulty to produce the correct target-side inflection based on available information COMBINING APPROACHES • Pre-processing – syntactic level Source-side reordering (Gojun and Fraser, 2012) • At decoding time – lexical level Discriminative classifier to score translation rules using source-side context (Tamchyna et al...
متن کاملBuilding Another Bridge over the Generation Gap
In this paper, we address one of the central problems in text generation: the missing link ("the generation gap" in Meteer's terms) between the global discourse organization as often provided by text planning modules and the linguistic realization of this organization. We argue that the link should be established by the lexical choice proces s using resources derived from Mel'~uk's Lezical Func...
متن کامل